CDS

Accession Number TCMCG058C21449
gbkey CDS
Protein Id KAF7143283.1
Location complement(join(5725326..5725508,5725682..5725825,5725905..5726021,5726125..5726213,5727317..5727400,5729421..5729508,5729601..5729675,5729892..5730044,5730348..5730429,5730641..5730708,5730830..5731886,5733551..5733597,5734732..5735028))
Organism Rhododendron simsii
locus_tag RHSIM_Rhsim05G0039000

Protein

Length 827aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA588298, BioSample:SAMN13241185
db_source WJXA01000005.1
Definition hypothetical protein RHSIM_Rhsim05G0039000 [Rhododendron simsii]
Locus_tag RHSIM_Rhsim05G0039000

EGGNOG-MAPPER Annotation

COG_category OP
Description Glutamate carboxypeptidase 2
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
KEGG_ko ko:K01301        [VIEW IN KEGG]
EC 3.4.17.21        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway -
GOs GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005737        [VIEW IN EMBL-EBI]
GO:0005773        [VIEW IN EMBL-EBI]
GO:0005783        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0009888        [VIEW IN EMBL-EBI]
GO:0010073        [VIEW IN EMBL-EBI]
GO:0012505        [VIEW IN EMBL-EBI]
GO:0032501        [VIEW IN EMBL-EBI]
GO:0032502        [VIEW IN EMBL-EBI]
GO:0043226        [VIEW IN EMBL-EBI]
GO:0043227        [VIEW IN EMBL-EBI]
GO:0043229        [VIEW IN EMBL-EBI]
GO:0043231        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044444        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0048507        [VIEW IN EMBL-EBI]
GO:0048856        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGGCAGGGCCGATATCGGACTGGGTGGTGAACCCGGTGGCGCCGCGGCCCAGACCGGCCACGTAATTTGGTGGGGGCTTCGTGTTGAGGAATTCTGGCCTAGAATTGGGTTAAACGGGGCCTGCATGATGACAGGGACGTGTCGGGATTCGGGCGAGGTGTCCGTGGAGGAGTCGTCTGGATGTGGGGTCGAGGCCTCCGTGGTGGCGTCGTCTGAGGTCTCTGTGGTGGAGCCGTCTGAGGTCTCCGTTGAGGGTTTTTCATTGTCTGTGGATTTGAAGAACACCATGGTTATGGAGAAGATGACTGCCGATGCACAGTTCCGTCCAAAAATGGTGTCCCCAAGGGTGTCAGACTATTTTGTTGTCTATCCACCCACATCAGCTCCTCCTACAAAGAATCCCCTTCAACAACCACATAACTCACCCAGAATTTCTATCGGCTCAAAAATTCAAGTTCCGATGATTAAAACAGCCACCATCACCTTCATAGCCATAGCCACCTCTCTATCCATCTTCTTTTCATCTCCTTCAAAAACTTCTTATCACAACTTATTCATATCCACATCAGACAATGCCTCAATATCACAACACCTCTTCACCCTCACTCGCCGACCCCATGTTGCTGGCTCTCACGCAAACGCTGAAGCCGCAGCCTATGTGCTGTCCACTCTCGATTCATATGATATCCCCTCACACATAGCATCCTATGATGTGCTCCTCACATATCCTGTCTCGCGTTCCCTAACACTAACACGCCCACCCCCTGATTCCCCCACCACTTTTGATCTAAGCCAGGAAATCTACAAAGGGGATCCATATGCGGATGTAGCTGATGAAGTCCTACCCACTTTTCATGCATATGCAAAATCGGGTACAGTATCTGGACCGGCGGTGTATGTGAATTACGGGCGCGTGGAAGACTATGCGATATTAAAGGGAATGGGAGTGAATGTGTCTGGTACTGTTGCATTAGCAAGGTATGGAGAGATTTTTAGAGGCGACATTGTAGAAAATGCTTATGATGCAGGTGCTGTAGGTGTAGTAATCTATACAGATAGGAAGGACTATGGTGGTGGGGGAGGCGGCGCAAAGTGGTTTCCAGATGACAAGTGGATGCCACCAAGTGGAGTTCAGGTGGGATCAGTGTTCCGTGGGACAGGTGATCCTACCACTCCTGGTTGGCCCAGTACCGGGACATGTGAGAGGCTATCAGACGATGAGGTGGAGCAGAGAGATGATGTTCCACATATACCTTCGTTGCCAATATCCTGGGCAGATGGTGATGCAATCATGAGATCGATTGGAGGGCTAGTAGCAAAGGATGATTGGCAGGGAGGCATAGATGCTCCAGTTTACAGGGTTGGACCAGGACCTGGACTTATCGAGCTTAGTTACACGGGGAAGAAAGTCATTAGCACAATTGAGAATGTTATTGGCATTATCGAAGGAGGTGAAGAACCTGACCGATTTGTCATCCTGGGTAATCATCGGGATGCATGGACATTTGGAGCTGTTGATCCCAACAGTGGCACTGCAACCATGCTTGAGATAGGATCAACAGAATGGGTTGAAGAGAACAGGGAAATGATAGCTTCAAGGGTTGTTGCTTACTTGAATGTTGATTCTGCAGTACATGCAGCAGGTTTCCATGCCTCGGCAACTCCACAGCTTGATGAACTACTTAAACAAGCCACTCAACAGGTTCAGGACCCCGATAACTCATCACAGACAATCTATGAAGCTTGGGTTGGCTCCAGCAAAAGCAATGAGCCCATGATTGGAAGGTTAGGAGGTGGAGGATCAGATTATGCAGCTTTCGTACAACATATTGGTGTTCCTTCAGTTGATATGTCTTTCGGCAAAGGTTATCCAGTCTACCACTCGATGTACGATGACTTCACCTGGATGAAGAAATTTGGTGACCCTATGTTTCATAGGCATGTAGCAGCGGCAAGTGTTTGGGGTTTAGTAGCTCTAAAACTCGCAGATGAGGAACTTTTGCCTTTTAATTATCTTTCCTATTCACATGAGCTCCAGAAAAGTGCAGAAGATTTAAAAGAGCAGGTATCAGATAAGGGCATAAACCTTGTTCCTCTGTTCAACTCTATAGAGAAGCTCAAAAGGGCAGCCACCAAAATAAATAACCAGAGAAAGGCATTGGAAGAAAATAAAGGTTGGGAATCAATTTGGAAAAAGGACCCTCAGAAGGTGAGAGAGTGGAATGACAGATTAATGATGGCAGAGCGAGCATTCATAGATCGAGATGGGCTCTCTGGAAGGCCATGGTCGAAGCATATGATTTATGCGCCTTCAAAACACAATGATTATGGATCTAAGTCCTTCCCTGGGGTTGATGATGCAATTGAAAAGGCTACGAGTCTTAACACAGCAGAGTCATGGCGTCTCGTTCAACATGAAGTTTGGAGAGTTTCTAGAGCTGTCACGCATGTATCGCTAGTACTCAATGGTAAATTGACATGA
Protein:  
MGRADIGLGGEPGGAAAQTGHVIWWGLRVEEFWPRIGLNGACMMTGTCRDSGEVSVEESSGCGVEASVVASSEVSVVEPSEVSVEGFSLSVDLKNTMVMEKMTADAQFRPKMVSPRVSDYFVVYPPTSAPPTKNPLQQPHNSPRISIGSKIQVPMIKTATITFIAIATSLSIFFSSPSKTSYHNLFISTSDNASISQHLFTLTRRPHVAGSHANAEAAAYVLSTLDSYDIPSHIASYDVLLTYPVSRSLTLTRPPPDSPTTFDLSQEIYKGDPYADVADEVLPTFHAYAKSGTVSGPAVYVNYGRVEDYAILKGMGVNVSGTVALARYGEIFRGDIVENAYDAGAVGVVIYTDRKDYGGGGGGAKWFPDDKWMPPSGVQVGSVFRGTGDPTTPGWPSTGTCERLSDDEVEQRDDVPHIPSLPISWADGDAIMRSIGGLVAKDDWQGGIDAPVYRVGPGPGLIELSYTGKKVISTIENVIGIIEGGEEPDRFVILGNHRDAWTFGAVDPNSGTATMLEIGSTEWVEENREMIASRVVAYLNVDSAVHAAGFHASATPQLDELLKQATQQVQDPDNSSQTIYEAWVGSSKSNEPMIGRLGGGGSDYAAFVQHIGVPSVDMSFGKGYPVYHSMYDDFTWMKKFGDPMFHRHVAAASVWGLVALKLADEELLPFNYLSYSHELQKSAEDLKEQVSDKGINLVPLFNSIEKLKRAATKINNQRKALEENKGWESIWKKDPQKVREWNDRLMMAERAFIDRDGLSGRPWSKHMIYAPSKHNDYGSKSFPGVDDAIEKATSLNTAESWRLVQHEVWRVSRAVTHVSLVLNGKLT